Sed 4415 optimize reporting request and query performance by david-stephan · Pull Request #566 · exense/step

david-stephan · 2025-12-08T12:21:59Z

No description provided.

This reverts commit eb71b1f.

chatgpt-codex-connector · 2025-12-08T12:22:04Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

...ntroller/step-controller-server/src/main/java/step/core/controller/StepControllerPlugin.java

jeromecomte · 2025-12-08T13:53:42Z

...ntroller/step-controller-server/src/main/java/step/core/controller/StepControllerPlugin.java

+		AsyncTaskManager asyncTaskManager = context.require(AsyncTaskManager.class);
+		asyncTaskManager.scheduleAsyncTask((empty) -> {
+			logger.info("ReportNode timeSeries ingestion for empty resolutions has started");
+			reportNodeTimeSeries.getTimeSeries().ingestDataForEmptyCollections();


I couldn't find the place where this was called previously. Wasn't this called?

Is it safe to do it asynchronously? I assume the controller would start and executions could start creating data too?

We have the same logic implemented for the time-series (response times) but it did not exists for the reportNodeTimeSeries.
Reinsgesting such new resolutions can take quite some time and I would say doing it async is the only option. Since empty resolutions are determined before any new execution can be triggered and a dedicated ingestion pipeline is used it should be safe However the filter currently used to re-ingest data it too permissive and could cause to re-ingest "new" buckets (created while re-ingesting). I will update the filter to make sure begin < controller_start_time (in ingestDataForEmptyCollections)

Filter filter = (Filter)(collection.getTtl() > 0L ? Filters.gte("begin", System.currentTimeMillis() - collection.getTtl()) : Filters.empty()); try (Stream<Bucket> bucketStream = previousCollection.findLazy(filter, searchOrder)) {

Thanks for the clarification. Agree that it should be done asynchronously. There's however a few things we should think about:

Ensure the re-ingestion is not interrupted: We could explicitly write in the logs that the controller shouldn't be restarted during the re-ingestion to avoid incomplete re-ingestions.

Ensure new data are not re-ingested: not sure if the condition begin < controller_start_time would be enough for large resolution. In theory the last bucket could fulfill this condition and be used for new executions while it is being re-ingested. Right?

In any case, we should benchmark the re-ingestion for large data sets

jeromecomte · 2025-12-08T13:54:52Z

...-server/src/main/java/step/migration/tasks/V29_1_UpdateTimeSeriesCollectionsAndSettings.java

+
+    @Override
+    public void runUpgradeScript() {
+        log.info("Renaming time-series 'main' collections to match their resolutions");


I was about to write that we should have only one main collection and understood later that we have 2 timeseries. We might explicitly list the 2 collection names

jeromecomte · 2025-12-08T15:14:14Z

...-server/src/main/java/step/migration/tasks/V29_1_UpdateTimeSeriesCollectionsAndSettings.java

+    private void updateSettingKeyIfPresent(String oldKey, String newKey) {
+        Optional<Document> setting = settings.find(Filters.equals("key", oldKey), null, null, null, 0).findFirst();
+        setting.ifPresent(s -> {
+            s.put("key", newKey);


I would add an explicit log entry here to be fully transparent

...ep-plans-core/src/main/java/step/core/artefacts/reports/aggregated/ReportNodeTimeSeries.java

jeromecomte · 2025-12-08T15:22:01Z

step-plans/step-plans-core/src/main/java/step/core/timeseries/Resolution.java

+public enum Resolution {
+    FIVE_SECONDS("5_seconds", Duration.ofSeconds(5), Duration.ofSeconds(1), false),
+    FIFTEEN_SECONDS("15_seconds", Duration.ofSeconds(15), Duration.ofSeconds(5), false),
+    ONE_MINUTE("minute", Duration.ofMinutes(1), Duration.ofSeconds(10), false),


if we rename the main collections, shouldn't we rename to 1_minute to be consistent?

I wanted to limit the migrations, but you're right

If this is too much of effort (including FE), we can also do it for the next major

jeromecomte · 2025-12-08T15:31:44Z

step-plans/step-plans-core/src/main/java/step/core/timeseries/TimeSeriesCollectionsBuilder.java

+        return enabledCollections;
+    }
+
+    public List<TimeSeriesCollection> getSingleTimeSeriesCollections(String mainCollectionName, TimeSeriesCollectionsSettings collectionsSettings, Duration resolution, Long flushInterval) {


Doesn't seem to be used

jeromecomte · 2025-12-08T15:33:50Z

step-plans/step-plans-core/src/main/java/step/core/timeseries/TimeSeriesCollectionsBuilder.java

+        List<TimeSeriesCollection> enabledCollections = new ArrayList<>();
+        int flushSeriesQueueSize = collectionsSettings.getFlushSeriesQueueSize();
+        int flushAsyncQueueSize = collectionsSettings.getFlushAsyncQueueSize();
+        addIfEnabled(enabledCollections, mainCollectionName,


If for some reason the settings of a time series don't exist, the whole collection is drop in addIfEnabled. Isn't this a bit risky? should we foresee an explicit cleanup flag or something similar to avoid unexpected drops of the main collection?

I think I'll completely remove the drop when the collection is disabled. Admins can always manually drop the collection in DB if required.

Exactly. I think it is safer

…performance

david-stephan added 7 commits December 1, 2025 13:36

SED-4415 optimize-reporting-request-and-query-performance

95daef3

SED-4415 optimize-reporting-request-and-query-performance

fcc5eb3

SED-4415 changing ideal resolution selection

eb71b1f

Revert "SED-4415 changing ideal resolution selection"

1394e08

This reverts commit eb71b1f.

SED-4415 introducing new resolutions

f77d472

SED-4415 migration, housekeeping, parallel aggregations

e91414a

SED-4415 bug fixing

8b99727

david-stephan requested a review from jeromecomte December 8, 2025 12:21

jeromecomte reviewed Dec 8, 2025

View reviewed changes

david-stephan added 3 commits December 9, 2025 11:06

SED-4415 PR feedbacks

3074e70

Merge branch '29' into SED-4415-optimize-reporting-request-and-query-…

e2dfa3d

…performance

SED-4415 bumping framework

993b011

Conversation

david-stephan commented Dec 8, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 8, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants